Statistical Guides for Literary Analysis: A Test

نویسندگان

  • A. Bookstein
  • R. Morrissey
  • S. Deerwester
  • K. Waclena
  • D. Ziff
چکیده

The existence of machine readable text makes possible the development of new techniques that assist the literary scholar in locating interesting passages of text. In this paper we explore in a preliminary manner the possibility of adapting techniques developed in the field of document retrieval to the full text context. As an alternative to the conventional Boolean logic based approaches, we consider techniques in which words are assigned weights automatically on the basis of how likely they are to distinguish useful passages from not useful passages, and in which passages containing the words with the largest weights are retrieved. In general, these techniques are iterative and take advantage of a user’s evaluation of initial retrievals. In the research reported in this paper we tested one stage of this process. Interesting passages (in our case, passages containing references to Charlemagne) were collected as a criterion corpus, and the weighting mechanism was used to assign weights to words on the basis of how well these words separated these passages from the rest of the text. The appropriateness of the weight assignment, based upon semantic considerations, is discussed, as well as new opportunities for research suggested by this study. key words: full text search, information retrieval, probabalistic retrieval, automatic information retrieval, Bayesian statistical methods: aplications to text search, weighted indexing, ARTFL, TLF, Charlemagne, Computer assisted literary research

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Lexicon-based Debates on the Felicity of Lexical Equivalents in Translating Literary Texts by Iranian EFL Learners

This study was an attempt to investigate the effect of lexicon-based debates on the felicity of lexical equivalents in translating literary texts by Iranian EFL learners.  To fulfill the purpose of this study, 59 university students, majoring in English Translation, were randomly assigned to the experimental and control groups from a total of 73 students based on their performance on a mock TOE...

متن کامل

Identifying the Ethical Competencies of International Tour Guides

Background: International tour guides, as the front line of touristschr('39') treatment of the destination community, have a great impact on tourist satisfaction and improve the quality of travel, and moral competencies are important. The purpose of this study is to identify the components of ethical competencies of international tour guides. Method: For conducting research, a mixed method bas...

متن کامل

Quantitative Analysis of Literary Styles

Writers are often viewed as having an inherent style which can serve as a literary fingerprint. By quantifying relevant features related to literary style, one may hope to classify written works and even attribute authorship to newly discovered texts. Beyond its intrinsic interest, the study of literary styles presents the opportunity to introduce and motivate many standard multivariate statist...

متن کامل

Rich Statistical Parsing and Literary Language

This thesisapplies the Data-Oriented Parsing framework in two areas:parsing & literature. The data-oriented approach rests on the assumptionthat re-use of chunks of training data can be detected and exploited attest time. Syntactic tree fragments form the common thread in the thesis.Chapter 2 presents a method to efficiently extract them from treebanks,based on heuristic...

متن کامل

Literary Analysis in the Shadow of the Critique of New-Historicism A Critical Review of Literary Analysis: The Basics‬

Abstract Hossein Payandeh translated Literary Analysis: The Basics in to Farsi on 1396, by that time the original book had been on the market for 8 months. This book includes theoretical discussions and practical evidence on theory, literary criticism, and their relation to literary analysis. Although the author has presented his analysis in three distinct chapters and in the form of three met...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007